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ABSTRACT 

Based on the comparison of a regular-sized (n=25) and 
an oversized (n=54) class of college students taught by the same 
instructor, significant superior performance was found in favor of 
the large class. This disparity persisted even when a switching 
regression model (with endogenous swi tching) • was performed. Only 397o 
of the learning disparity can be explained by characteristics of the 
student, including a 29.5% performance gap that is attributed to the 
potential "overachiever " trait of small class students. In addition, 
it was found that overall grade point average and performances in 
previous collide economics courses did not have any significant 
impact on the students' current scores. Contrary to most of the 
findings in recent literature, high school economic education did not 
have any positive impact on the students' performance. In fact, it 
had a slightly negative effect. No evidence of gender difference was 
found for the large class, but a fairly .significant lower performance 
was found for females in the small class. The student questionnaire 
is attached. Three tables are included. (Contains 28 references.) 
(Author/SLD) 
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Abstract 

Based on the comparison of a regular sized class and an oversized 
one, significant superior performance was found in favor of the large 
class. This disparity persisted even when a switching- regressions model 
(with endogenous switching) is performed. Only 39% of the learning 
disparity can be explained by characteristics of the student, including 
a 29.5% performance gap that is attributed to the potential 
"overachiever" trait of small class students. 

In addition, it is found that overall GPA, performances in previous 
college economics courses do not have any significant impact on the 
students' current score. What is more interesting is that, contrary to 
most of the findings in recent literature, high school economic 
education does rot have any positive impact of the students' 
performance. In fact, it has a slightly negative effect, No evidence 
of gender difference is found for the large class, but a fairly 
significant lower performance is found for females in the small class. 



Class Size and Determinants of Learning Effectiveness 

By 

Jack W. Hou* 



A gradual increase in class size has been a national trend since the mid 
1980s. The reason is obvious. Universities are facing budget cuts, while 
departments are trying to retain or increase enrollments to fight for 
operational funds and the ability to offset courses with lower enrollments. 
These, among others, certainly signals that oversized sections will be a 
fixture of academic life for at least the foreseeable future. 

Economics aside, the concern we should have as educators is how this will 
affect the students' learning? This study is an attempt to evaluate this 
issue. It is surprising that direct measures of potential student learning 
differences between heterogeneous class sizes are conspicuously lacking, at 
least regarding college economics. Though this study is a comparison between 
two upper division economics classes, identical except for the clas;: size, the 
experiment can easily be applied to any economics courses, and to other 
disciplines for that matter. 

The objective of this study is to use a technique originally designed for 
discrimination analysis to decompose the measured learning difference (in 
terms of test scores) into a portion that is accountable by the differences in 
student characteristics and the residual, which can be interpreted as due to 
the difference in class size. For a more detailed discussion of this 
methodology, refer to Hou (1991, 1993ab) . 

* Associate Professor, Department of Economics, California State University, 
Long Beach. Partial support was provided by the Institute for Teaching and 
Learning, California State University System. I am grateful to Wayne 
Phillips for his help in providing some of the students background 
information. Sharon Yamada provided excell-ent assistant in terms of data 
entry. The usual disclaimer applies. 



The size and direction of this residual will have significant impact on the 
direction of the classroom structure. If there is no difference in learnirg 
(or even in favor of larger classes), simple economics should dictate the 
gradual conversion towards larger class sizes , as the only cost is perhaps 
building bigger rooms while the benefit would be obvious. If larger class 
size does hamper the student's learning, but only minutely, one may still be 
able to justify the larger class size in terms of its economies of scale. 
However, if the negative effect on the students' learning is significant, a 
tough choice will- have to be made in this time of budget difficulties. 

The remainder of the paper is organized as followed. I will open with a 
brief description of the data. Section II will outline the methodology and 
the model. This is followed by the empirical estimates and discussion, A 
brief summary will conclude the paper. 

I. Data Source 

Ideally, we would like to measure the effect of class size not just in the 
contemporary disparity in learning, but also the difference in the half- life 
of the knowledge between students from the two classes. = However, clear and 
direct measures are hard to get, if they exist at all, especially regarding 
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the half -life of learning. = 

For realistic reasons, this study is based on the data collected from two 
classes (Managerial Economics or ECON333) in which I was the instructor: one 
was an "oversized" section, while the other was a normal section. This offers 
an excellent setting for examining the issues raised above. Since I am 
teaching both classes (back-to-back in the same semester), the usual problem 
of instructional heterogeneity is negligible. To further minimized any 



3 

differences, I consciously attempted to make the material and presentation as 
homogeneous as possible. 

A survey questionnaire (Appendix) was developed and implemented in both 

_ 3 

classes. = This generated detailed data regarding various relevant 
characteristics (both personal and family background) of the student. With 
the students' authorization, and the aid of the University Registrar, various 
information, such as current and high school GPA, SAT scores , etc. , were 
extracted from the student files. Combined, these information formed the data 
base for the empirical study. 

For the measurement of the students' learning, several quizzes were made 
identical between the two classes. In addition, one section of each midterm 
and of the final exam were also made identical between the two classes. 
These, combined with the fact that the lectures were maintained as homogeneous 
as possible between the two classes, allows for a meaningful comparison of the 
students ' learning . 

There are a total of six common quizzes and six common exam questions. A 

two-sample t test was calculated based on the following test statistics 

_ Y L - Y s 

[5x(l/n L +l/ns)] 

where _ _ 

Yl, Y s •= observed average grade of the large (L) and small (S) class 

2 2 

_ (n L -l)cr L + (n s -l)g- s 
r\i + n s - 2 

t^l, n s - number of students in the large and small class 
This statistic has a student t distribution with ni/fn s -'2 degrees of freedom. 
For the twelve pair of comparison, the large class out-performed their small 
class counterpart in only two, and neither was statistically significant. In 
the ten pairs where the small class scored better, six of them were 
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statistically significant at the (2-tailed) 5% level. Out of these, three 
were significant at the 1% level. The "observed" superior performance of the 
smaller class is evident. However, as will be explained in the next section, 
this may be quite misleading. 

The common quizzes and exam questions were next standardized separately, « 
added up according to the weights indicated in the class syllabus , and the 
final score is standardized again.. Based on this procedure, for the combined 
sample, the mean or average is zero with a standard deviation of one. 
However, for the two classes separately, the difference is apparent. For the 
small/regular class, the mean is 0.40, as compared to the -0.18 for the over- 
sized class. The comparison for the standard deviation is 0.85 versus 1.02. 
Thus confirming intuition that students in the small class perform better and 
are more consistent. Based on the two-sample t test described above, the 
difference in the mean is significant at the 1% level. 

A summary of several other key variables are presented in Table 1. As can 
be clearly seen, many characteristic or "endowment" differences exist between 
the two classes. In general, students in the small class are more likely to 
be males, Asians, those with betters grades (in terms of GPA, and performance 
in Principles of Microeconomics and Calculus), and transfers from either 
Junior College or other universities . This perhaps foresee the problem of 
selection bias that may hamper OLS estimates, = which will be discussed below. 

An interesting point is observed if one contrasts the student self-reported 
GPA versus the GPA provided by the Registrar. The former is consistently 
higher than the latter. The two-sample t test between all three pairs of 
GPA(R) and GPA(A) indicate that the two measures are statistically different 
(at the 1% level) for all pairs. In other words, the student's "error" in 
reporting their GPA cannot be explained as "rounding error". 
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What is surprising is the high percentage (close to 80%) of students 
holding at least part-time jobs. Of these students, they work an average of 
25 hours/week. This perhaps reflects both the nature of the institution and 
the difficult economic times we are in. Another startling statistics is the 
low percentage of students that started their undergraduate education at the 
University. Close to three-quarters are transfers either from JC (Junior 
College) or other four year universities. From a Departmental point of view, 
this certainly deals a heavy blow to the enrollment of lower division classes, 
and eventually affects FTE (Full-Time-Equivalent) and funding. 

A word of caution. Due to the sample size, the empirical estimates and 
implications should be taken with a grain of salt as their robustness may be 
questionable. The main objective here is to initiate interest in addressing 
this question of the effects of class size on student learning, and to 
demonstrate the importance of using the correct methodology. 

II • Methodology 

The most intuitive, and often used measure, is the direct comparison of the 
observed average grade between two classes based on some uniform exam. This 
is crude and can be misleading as it ignores the differences in student 
composition (what shall be termed as characteristic "endowments") between the 
classes. Thus, the traditional method has been to estimate the following 
regression 

(1) Yi = pXi + 7 Di + «i 

where Yi = standardized test score of student i 

Xi « a set of explanatory variables including the students personal/ 
family characteristics, GPA/SAT scores, etc. 
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Di « binary class size variable (1 for small class, 0 otherwise) 
ei - random error term 

The "/3" coefficients reveal what determines the students learning potential, 

while the "7" coefficient will indicate the effect of class size. 

Though an improvement over the direct comparison of average performance, 
this measure of class size effect is crude/biased, and potentially erroneous. 
The reason is tri-folded. First, the exams/quizzes are different between 
classes. Second, The effects of "Xi" attributes may work differently in a 
small class relative to a large one. And finally, students in the small class 
may come from a different population that those in the large class (i.e. 
certain 11 intangibles" or unobservable attributes). 

To bring the last point sharper into focus, let us assume that those more 
eager to learn (or, "better" and more self-motivated students) "self -select" 
into the small classes, while those that are passive and desire anonymity 
choose the large classes. Further, suppose the small class is observed to 
out-perform the large class. This may erroneous lead one to the conclusion 
that a larger class diminishes the students' learning potential. while in 
reality, it is possible that the seemingly superior performance of the smaller 
class is simply due to the fact that it is made up of more diligent students, 
and class size really have no significant effect at all. This will be dealt 
with later. 

To compensate for the first two of the aforementioned problems with 
equation (1), separate regressions will be estimated for each class, limiting 
to only portions of the grades that come from identical quizzes and questions 
in exams. This leads to the following two regressions 

(2) Y Si - ftjXsi + esi 

(3) Y Li - /3 L X L1 + 6 Li 

ERLC J 



where the subscript S and L stand for the small (i.e. normal or regular) class 
and the large or oversized class respectively. 

Using a method originally designed for "discrimination" analysis in labor 
economics, one can then estimate the learning differential between the two 
classes. In a nutshell, the method estimates what the small class student's 
measured learning would be if he or she was in the large class. Deducting 
this from the observed learning difference is what I shall term as "class 
size" effect on student's learning. 

More specifically, the "gross" learning differential is defined as G - Y s - 
Y L , where Y s and Y L are the average test scores of the small and the large 
class, respectively. By definition, these average test scores can be 
rewritten into 

Yj « /3jXj j - S, L (j3j is the least squares estimate of £j) 

Further define 

This is the "expected" test score of the average student in the small class if 
he/she chose to attend the large class instead. The "gross" learning gap can 
be decomposed into two parts, 

(4) G « 'Ys - Y L « (Y S - Ysl) + (Y SL - Y L ) 

- (is - /3l)X s + MX S - X L ) - R + E 
where "E" represents the learning differential which is due to, or can be 
"explained" by, differences in characteristics (i.e. "endowments"), while "R" 
is the residual difference which can be viewed as the class size effect. 

However , due to the endogenous nature of the student' s choice between 
attending the small versus the large section, the OLS estimates from the above 
regressions (equations (2) and (3)) will be .biased.- To account for this 

t \) 



8 

endogenous class (size) selection decision, a probit model will be used to 

7 

generate a selection-bias-correction variable in the Heckman-Lee tradition. = 

To motivate the discussion, the students are assumed to be utility 
maximizers. The behavioral assumption is that he/she will choose the section 
that generates higher utility. Student i is assumed to have, an indirect 
utility function of 

Vi = V(Yi, DiBi) 

where 

Yi = learning (in terms of test scores) 
Di - binary class size dummy 

Bt = net non-grade benefits associated with small classes 
Vsi 5=5 V(Ysi, Bi) and V Li = V(Yu, 0) denote the student's utility level if he or 
she chose to be in the small and large class respectively. I shall further 
assume that the student's preference is characterized by a linear, additive 
indirect utility function of the form 

Vi « Yi + DiBi 

Since the maintained behavioral assumption is that students will choose the 
class which generates higher utility, an indicator variable can be defined as 

(5) Ii - V Si - V Li - Y Si + Bi - Y Li 
where 

Ii - comparative index (Ii > 0 implies that the small class is more 
attractive to the student and 1^ < 0 implies the opposite) 

Each student has a potential learning function for both the small and the 

large class, as shown in equations (2) and (3) above, and the net non-grade 

benefits can be written as 

(6) Bi - A>X b i + 6 bi 

Substituting (2), (3), and (6) into equation (5) yields 

(7) Ii « Y Si - Y Li + Bi - /3sX Si - b L X Li + bbX H - rn - bZi - Ql 
where ct j , Xji « as defined in equations (2) and (3) 



b •= vector of coefficients 

Zi «= all variables in X S i, X L i and X bi 

1i - -(«si - «Li + «bi) 

If Ii > 0, the student derives higher utility from the small class, while if 

Ii < 0 then the large class is preferred. The binary class dummy is thus 

r 1 if Ii > 0 

Di = - 

1 0 if L < 0 

Assuming that r li is a standard normal random variable, the probability that 
student i will choose the small class is given as 

Pr(Di - 1) - Prdi > 0) = .Pr(^ < 0Z L ) - *( i 9Z i ) 

where 

«C) = cumulative density function (CDF) of the standard normal 
distribution 

The probability that the student prefers the large class is 
Pr(Di = 0) = Prdi < 0) = Pr(tfi > /? Zi ) = 1 - <J>(/? Zi ) 
The log-likelihood function then can be written as 

(8) * - I Di^nS^Zi)] +1(1- Di)in[l - $(/? Zi ) ] 

The probit model will be estimated by maximum likelihood methods. Based on 
the results, the potential selectivity bias created by the endogeneity of the 
individual's sectoral choice can be corrected. 

The Z variables that characterize the class selection are conceivably the 
same as those that determine their learning effectiveness (i.e. equations (2) 
and (3)). To identify equation (8) from the two learning regressions, the Z 
variables includes a class size "preference" variable. = 

Based on the estimates of the probit function as shown in (8), the 
following variables can be defined to correct for the selectivity bias 

(9) A Si = - an( i „ 



*(£Zi) 1 - $(/?Zi) 
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where 

Asi, A Li « Heckman-Lee type selection bias correction terms (i.e. the 
inverse Mills ratio). 
*(') - the PDF and CDF of the standard normal distribution. 
P, Zi « refer to equation (7) above. 

Equations (2) and (3) can now be rewritten into 

(10) W Si - £sX S i + a s A Si + f S i iff Ii > 0 

(11) w Li « i 9 L x Li + aLAu + iff IjL < o 
where 

£ji - *ji ~ ^j^Aji j - S , L 

a 6 - Cov(€ji, r?i) j m S, L 

OLS will generate consistent estimates of 0 S , Pl, o s and a L since the error 
structures £ S i and £ Li have zero conditional means. « However, the standard 
errors will be biased, since they ignore the fact that the probit coefficients 

A 

P (ML estimates of p) were estimated in the first stage. The standard 
errors will be correct following Lee, Maddala, and Trost (1980). 

This will allow the explicit account for the student's class (size) 
selection choice (often termed in the literature as a "switching regressions 
model with endogenous switching"). The decomposition of the observed learning 
difference shown in equation (4) will result in the "corrected" measure of the 
effect of class size. This will effect will be purged of the intangibles that 
lead to the students selection between the class sizes. 



III. Empirical Findings and Implications 

As discussed in Section I, the grades for the total sample (i.e. both 
classes combined) was constructed such that it has a zero mean with a standard 
deviation of one. However, for the two classes separately, the difference is 
apparent. For the small/regular class, the mean is 0.40, as compared to the - 
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0.18 for the over-sized class. The comparison for the standard deviation is 
0.85 versus 1.02. Based on the two-sample t test described above, the 
difference in the mean is significant at the 1% level. 

This is a crude measure at best and is potentially misleading as , among 
other things, it ignores the differences in the composition of the two 
classes. If this was the only problem, a standard regression analysis 
(equation (1)), with a dummy variable (taking on the value of 1 for students 
in the small class) to capture the effect of the difference in class size, can 
handle it. Column (1) of Table 2 represents such a regression. As can be 
seen, the effect of class size (CS) is insignificant. This seems to suggest 
that after controlling for student characteristics, there is no statistically 
significant learning difference between the two classes. 

This, however, may lead to an erroneous conclusion, as it assumes that the 
only difference in learning between the two classes is a "shift" disparity. 
In other words, characteristics are assumed equally "productive" in generating 
learning in both class settings. This is certainly not what one would 
commonly expect. In statistical terminology, this regression constrained y9 s = 
/3 L (equations (2) and (3)) with the exception of the intercept. 

To allow for "shadow" price differences, equations (2) and (3) are 
estimated separately and presented as columns (2) and (3) in Table 2. 
Clearly, the coefficient estimates are quite different between the two 
samples. Statistically, most of the difference is due to the effects of Load 
(number of units taken), Hispanic ethiniticity (Hisp.), transfer from another 
four year college/university (University), and number of siblings (SIBN) . 

As can be seen, the intercept (the "shift" variable) is not statistically 
different between the two samples. This clearly demonstrates the inadequacy 
of the simple model (shown in column (1)) where one tries to capture the class 
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size effect with a mere binary dummy variable. 

Several interesting points arise from the coefficient estimates of Columns 
(2) and (3) . Somewhat' surprisingly, the overall GPA had no significant 
predictive power on the student's learning. This is in sharp contrast to many 
studies on student performance (e.g. Raimondo , Esposito , and Gershenberg 
1990). However, the student's grade in (Business) Calculus positively 
affected their learning, and was statistically significant for both samples. 
Though Principles of Microeconomics (P. Micro) showed no effect for the small 
class, it was an important determinant for the larger class. These are not 
surprising as Managerial Economics is a calculus-based and micro-oriented 
course . 

The Load variable showed a sharp contrast between the two classes. It had 
no effect in the larger section, but was positive and very significant for the 
small class. This implies, for the students in the small class, the more 
units they are taking the better their performance. Since this is lacking in 
the large class, one may conjecture that the "overachievers" tend to self- 
select themselves into the small class. If this is the case, it indicates a 
potential biases in the OLS estimation (see Section II above). = 

Though marginally significant at best, high-school economics (HSECON) 

exhibited a negative impact on the students' performance. The effect of high 

school economics course on the student's' performance in college has long been 

of interest to economists, and the evidence has been mixed. Palmer, Carliner, 

and Romer (1979) found that students with high-school economics background 

tend to do slightly (and statistically significantly) better in the "pre- 
11 

test" ,= while Reid (1983) found the opposite that they achieve significantly 
lower grades in college level economics. 

More recently, studies by Walstad and Soper (1988), Myatt, and Waddell 
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(1990), Becker, Greene, and Rosen (1990), Brasfield, Harrison, and McCoy 
(1993) all found that high school economics c'oes have a positive effect on the 
students economic knowledge and performance, with Marlin (1991) as perhaps the 
sole exception. The negative effect of high school economics education found 
in this study does seem to be consistent with the earlier findings of Palmer, 
Carliner, and Romer. 

Two (related) reasons may lie behind this. First of all, most of the 
aforementioned studies were regarding the effect on performance in principles 
courses or entry level tests on economic literacy, while the study here is on 
the effect of high school economics courses on the performance of students in 
an upper division economics course. Second, as Becker, Greene, and Rosen 
(1990) pointed out, the positive impact of high school economics may not have 
a lasting effect. The negative contribution of high school economics found 
here may well be due to these reasons. 

Another marginally significant variable is whether the student went to a 
private high school (HSPRI) . It also tends to negatively affect the student's 
performance. This is somewhat puzzling. The cost of private high schools are 
substantially higher than public schools. Granted, there may be prestige or 
safety associated with private schools, the expectation of a superior 
education (at least on average) is also undeniable. Then, why the observed 
negative effect on the student's performance? Without more rigorous testing, 
any statement is mere conjecture. With this in mind, one possibility could 
be that the upper end of the distribution of private high schools tend to 
enter the UC (University of California) system, while the mid to lower end of 
the distribution are sorted into the CSU (California State University) system. 

Another interesting variable is the effect of transfers from other 
universities or four year colleges. It showed a negative effect (though not 
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statistically significant) for the students in the large class, however, for 
the small class, the effect was positive and very significant. This can again 
be viewed as a sign of self -selection. 

Before I turn to the selection bias stemming from this potential self- 
selection, the observed learning or performance differential between the two 
classes will be decomposed according to equation (4) 

(4) G = Y S - Y L = (Y S - Y SL ) + (Y SL - Y L ) « (£ s - £l)X s + £ L (X S - X L ) 

- 0.3973 - (-0.1839) = (0.3973 - [-0.1270]) + (-0.1270 - [-0.1839]) 
Thus, the decomposition become 

0.5812 - G - R + E - 0.5243 .+ 0.0569 
As can be seen, the explainable portion of the difference is very small indeed 
(less than 10%) relative to the residual (more than 90%). This latter 
difference can categorically be attributed to the effect of the class size. 
Compared to the binary dummy approach embodied in column (1) of Table 2, the 
effect of class size is consistent (i.e. smaller class size lead to higher 
learning) . 

However, both the magnitude and the significance level of the learning 
difference is markedly higher once the difference in learning effects of the 
characteristics are allowed to differ between the two classes. In tarms of 
the size of the learning difference, the dummy variable approach lead to an 
estimated gap of 0.2851 (or 49% of the observed difference), while the 
"switching" regressions model led to a much higher 0.5243 (or 90%) learning 
differential content due to class size. Statistically, the dummy variable 
estimate had a mere 1.27 t statistic, implying that the difference is not 
statistically different, even at the 20% significance level, from zero (as in 
no learning difference due to class size). While in contrast, the difference 
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implied in the switching regressions model complied a 13.10 (two sample t 
test) test statistic, which is significant at the 1 % level. 

This result is hardly surprising. Recall, the binary dummy approach 
essentially assumes that the shadow price (or coefficients) for the 
characteristics are the same between the two classes, with the only difference 
in the "shift" or intercept. Casual examination of columns (2) and (3) in 
Table 2, clearly refutes this. The bulk of the difference appears to be in 
the shadow price or slope coefficients, while the intercept is similar 
(statistically insignificantly different) . 

However, due to the possibility of the student's endogenous self-selection 
in terms of what type of class to enroll in, the above results may be biased. 
To account for this self-selection, the probit function as shown in equation 
(8) is estimated via ML methods. The Heckman-Lee selection bias correction 
term (equation (9)) were calculated and incorporated into equations (10) and 
(11). The estimated results are shown in Table 3. 

Column (1) presents the estimates of the probit function (equation (8)). 
As expected (from the examination of Table 2) , students with larger loads tend 
to select the small class (perhaps signifying "overachievers") , and transfers 
from other universities also tend to prefer the small class. Other variables 
that show significant explanatory power in terms of this selection process 
includes the ethnic variable of Hispanics (Hisp.) and the binary dummy for 
Junior College (J.C.) transfers. The former was not all surprising as the OLS 
estimates in Table 2 foresaw this. The latter is somewhat unexpected as the 
percentage of students being J.C. transfers was about the same (Table 1), and 
there was no statistically significant difference between their OLS estimates 
(Table 2), though they were of opposite signs. 

The sole other variable that had marginal predictive power is the order the 
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student ranks among his siblings, i.e. first born, second born, etc. The 
positive coefficient implies that the lower the order in the rank (i.e. third 
born, as compared to say, first born) the more likely the student will select 
the small class. Though one can certainly offer a number of speculation as to 
why we observe this, but they are just that, speculations. 

The identification variable chosen is the student's class size preference. 
It is assumed that this affects their choice between the two classes, but have 
no effect on their learning effectiveness. The log likelihood of the probit 
estimation (column (1) of Table 3) is contrasted with that of a constrained 
version (where all coefficients are forced to zero with the exception of the 
intercept) to test for the significance of the overall model suggested by 
equation (8). This likelihood ratio test resulted in a rejection of the null 
hypothesis (that both versions had statistically similar explanatory power) at 
the 1% significance level. 

The selection bias correction variables (equation (9)) are calculated based 
on the above probit estimates and entered as an additional variable (SBCV) in 
the two learning regressions (equations (10) and (11)). The estimated results 
are shown as columns (2) and (3) in Table 3. Several things stand out. First 
of all, the explanatory power of the regression for t.ie small class improved 
significantly. This can be seen in the higher adjusted R-squared, and the 
increase in the significance level of coefficient estimates. The OLS 
regression had six regressor statistically significant at the 10% or better 
level. These same six regressors are still significant after correcting for 
the selection bias, and at a higher significance level (e.g. Calculus had a 
significant effect at the 5% level in the OLS regression, while the 
significance level is raised to the 1% level here). 

In addition, two previously unimportant variables have emerged with strong 
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predictive powers: Gender and J.C. transfers. The former is of special 
interest, as a sizable literature has accumulated regarding gender difference 
in performance, especially in high school or below. Earlier studies tend to 
show a superior learning of economics by males. Moyer and Paden (1968) and 
Highsmith (1974) found that high school males outperformed their female 
counterparts. Siegfried (1979) surveyed previous studies and concluded gender 
differences in learning and understanding of economics were absent at the 
elementary school level, but by high school a distinct gap appears and 
persists into college. 

However, subsequent studies painted a. somewhat different picture. 
MacDowell, Senn, and Soper (1978) found no evidence of gender difference among 
junior high school economic education. Jackstad and Grootaen (1980) and Hahn 
(1982) found that the gender variable had no predictive power regarding the 
learning of economics among high school students. Both Ferber, Birnbaum, and 
Green (1983) and Lumsden and Scott (1987) found evidence that males outperform 
females in multiple choice exams, while indifferent or inferior to females in 
essay exams. However, Williams, Waldauer, and Duggal (1992) found no 
significant gender difference in college economics performances. 

Much of the above discussion mainly pertain to the regression of the small 
class, a similar pattern is categorically absent for the large class. This is 
consistent with the coefficient of the SBCV, which was much more significant 
for the small class than it was for the large class, indicating a significant 
positive self-selection= among students in the former but lacking in the 
latter class. This finding is not unexpected either, since the large classes 
are almost like defaults, while a student will have to "seek out" a small 
class. In a typical semester, the department offers ten sections of ECON333 
(Managerial Economics). Out of these ten, seven will be oversized sections, 
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while only three would be small or regular sized classes. This combined with 
the fact that oversized sections usually have the capacity of at least twice 
as much as a small class, students may indeed have to "seek out" these scarce 
seats in the small sections. 

Based on the selection-bias-corrected estimates (columns (2) and (3) in 
Table 3) , the observed learning difference was once more decomposed. After 
accounting for the potential "overachiever" (or at least higher motivated) 
nature of students in the small class, the following was found: 

0.5812 - G - R + E - 0.3530 + 0.2282 
The "class size" effect (i.e. the term "R" ) dropped significantly from 0.5243 
(or more than 90% of the observed difference) to 0,3530 (or approximately 
61%). The t statistics also decreased to 5.35 (relative to the 13.10 under 
the OLS estimate), but still statistically significant at the 1% level. 

In other words, though the potential higher self-motivation of the students 
in the small class (which implies that these same students will still perform 
adequately even if they were in a large class setting) does explain 
significantly why the small section out-performed the larger section, however, 
a significant portion of the difference remain unexplainable and are 
attributed to the effect of class size. 

Conceptually, if one compares the OLS estimated "class size" effect of 
0.5243 with its counterpart of 0.3530 after the selection bias was corrected 
(i.e. accounting for the potential "overachiever" nature of the small class), 
the difference may be loosely interpreted as the effect of this higher 
motivation of the small section students. In other words, the superior 
motivation of the students account for 0.1713 or 29.5% of the apparent 
learning difference. 
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Summary and Conclusion 

Due to budgetary problems and a host of other factors, a national trend of 
moving towards larger classes seem seductive and inevitable. Oversized 
classes has been a rather common for lower division courses for quite some 
time. The new tendency is to extend this practice into intermediary or even 
upper division class. This paper attends to use rigorous econometric methods 
to analyze the effect of different class size settings on the students' 
learning for a .calculus based intermediate level economics class. 

As expected, the small class students performed significantly better than 
their large class counterpart in a set of standardized quizzes/exams. This 
observed difference was then decomposed into an explainable portion and a 
residual. This latter portion can be interpreted as the learning disparity 
caused by the difference in the class setting. It was found that only 10% of 
the difference can be explained by observable heterogeneity between the 
students of the two classes, leaving 90% of the difference as due to class 
size. Even after accounting for the fact that those students who self -select 
themselves into the small class might be potential overachievers (or at least 
more self-motivated and disciplined), 61% of the learning disparity remain 
unexplained. 

In conclusion, a disparity in learning is evident in favor of the smaller 
class. Though the size of this disparity is smaller (roughly 61%) than the 
observed difference, it is sizable and statistically significant (at a 1% 
significance level). Serious thoughts need to be devoted in the direction 
resource management. On the one hand, larger class sizes have its obvious 
economic efficiency in terms of its cost effectiveness, but on the other hand, 
it is our duty as educators to provide the beat learning environment possible 
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for our students. 

Whether this "cost" of lower learning capacity is enough to outweigh the 
"benefit" of savings in operation budget is not the objective of this study, 
nor is it within the ability of this author. This study is merely designed to 
provide the best possible measure of the effect of class size on student 
learning, given the various constrains the data poses. 

On this note, it is prudent to remind the readers once again that due to 
the size of the sample, the robustness of some of the findings are suspect. 
However , the objective of this study is to demonstrate the methodology and 
model than can and should be used in such studies. It is meant to motivate 
further research in hope of serving as a demonstration. In light of this, 
sufficient empirical evidence should have been presented. 

In addition to expanding the size of the study to test the robustness of 
the findings here, further research can move in several directions. For 
example, it would be of great interest to compare the magnitude of the class 
size effect between different levels of economics courses. In other words, 
have a minimum of one large and one small classes for two distinct level of 
classes (e.g. a pair of principles, versus a set of intermediate). It might 
well be that the class size effect is much smaller , or even negligible for 
lower division principles courses. Another extension is to extend such 
studies into different disciplines. There is no reason to expect just because 
their is a significant class size effect in the economic disciple there is a 
comparable difference in, say, political science. 
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Footnotes 

1 Craig, O'Neill, and Elfner (1979) found that class size did not affect 
student retention of knowledge. 

2 There are a handful of surveys (e.g. TUCE and the revised TUCE) that attempt 
to generate these measures but are generally plagued by many problems. 

3 I benefited greatly from the study conducted by Jane Lopus and Nan Maxwell 
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at the California State University, Hayward. 

4 The two classes are pooled together. The mean and standard deviation are 
calculated. Based on these, the grades are converted into a standard normal 
distribution. 

5 Refer to Heckman (1976) and Lee (1978) . For a more recent discussion in 
terms of "discrimination" analysis, see Hou (1991, 1993ab) . 

6 This selectivity bias is different from the one raised by Montmarquette and 
Houle (1986) . Their concern was the bias rising from the "sample" 
selection, while the issue here is the bias stemming from the student' s 
endogenous self -selection of class size. 

7 Peterson (1992) provides a condensed and informative description of the 
source of the selection bias. For a full discussion, refer to Heckman 
(1976) or Lee (1978) . 

8 In the survey questionnaire, the students were asked whether they preferred 
large or small classes. 

9 Though unbiased, these OLS estimates are less efficient due to hetero- 
scedasticity between ng j_ and n^ . To gain full efficiency , weighted least 
squares should be used in place of OLS. Conventionally, the literature 
ignores this heteroscedasticity and I will follow suit. 

10 This could, alternatively, be due to explicit differences in "effort" 
between the students of the two classes (Borg, Mason, and Shapiro 1989) 
rather than (or in addition to) unobservable traits of "overachievers" , 

11 But slightly (but not statistically significant) worse in the "post-test". 

12 The "selectivity effects" are simply os\s and aiXi (refer to equations (9) , 
( 10) , and (11) ) . A "positive" self-selection occurs when the selectivity 
effect is positive. This implies that the students are choosing the 
classes based on their relative comparative advantage, i.e. those students 
that would do better in a small class setting choose to enroll in the small 
class. 
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Table 1 Summary Statistics of Selected Variables 
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Standard deviation in parentheses. 

•>v ? ** t *** denote statistically significant difference between 
the two classes at the 10%, 5%, and 1% level, respectively. 

Notes: a) GPA(R) and GPA(A) refer to the student's self-reported 
and actual (obtained from the Registrar) GPAs . 

b) Principles of Microeconomics 

c) # of units taken in the Fall semester of 1992. 

d) # of hours of work per week (conditional mean) . 
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Standard errors in parentheses. 

* t *** denote that the coefficient is statistic- 

ally significant (2-tailed t test) at the 10%, 5%, 
and 1% level, respectively. 
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and 1% level, respectively. 
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Appendix: Student Questionnaire 



You are being asked 'to participate in an important pilot experiment to 
evaluate the determinants of teaching and learning, aimed at developing 
methods to improve the teaching of Managerial Economics. All your responses 
will be kept strictly confidential. The actual use of this survey will not be 
until the sta rt of the next semester (after the course grades are given ) at 
the earliest. In order to obtain some crucial information, we also need your 
authorization to access your student records at the University Registration. 
This will also, of course, be implemented next semester . Once the records are 
matched, all traces of your personal identity will be destroyed. Your 
complete anonymity is assured. Based on the methodology, it should be clear 
that your grades in this course will not be affected in any fashion. Your 
cooperation is greatly appreciated. 

Yes , I authorize permission to access my student records for 
analytical purposes only. 

Signature : 



1 . Name (print) : 

(Last) (First) Middle 

2. Student I.D. # 

3. Gender (circle one): Male Female 

4. Ethnic Origin (circle 1): White, Black, Hispanic, Asian, 

Other (specify: ) 

5. Date of Birth 

6. Were you born in the U.S.? Yes No If no, how old were you when you 
(and/or your family) moved to the U.S.? /19 . Is English your native 
language? Yes No ~~ ~~ 

7. Are you married? Yes No If yes, number of children younger than 6: 

8. Have you taken a high school economics course? Yes No. 
If yes, how long was it? 1 year, 1 semester, shorter. 
Was it mainly Macro, Micro, Both? 

9. What type of high school did you attend: Public, Private. 
If private, was is Catholic/Christian? Yes No. 

10. Between graduation from high school and now, how long have you been out of 
school? 

11. Have you taken the Scholastic Aptitude Test (SAT)? Yes No 

If yes, what was your score? ~~ " ~ ~ 
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12. Have you taken the American College Test (ACT)? Yes No 

If yes, what was your score?. 

13. Are you a (circle one): Sophomore, Junior, Senior, 

Other (specify: ) 

14. Are you a transfer? Yes, No. If yes, from a Junior College, another 4- 
year college/university ( ^ 

15. What is your declared major (Accounting, Economics, etc.): 

If undeclared, what is your planned major: 



16. What is your current Grade Point Average (GPA) : 

17. When did you take ECON201: G f 19 (with a grade of ) 

When did you take MATH115b: G f 19 (with a grade of ) 

18. Have you taken ECON333 before? Yes, No. If yes, when: Q f 19 

19. How many economics courses (other than 201, 202, 333) have you taken? 

20. How many calculus courses have you taken: 

21. How many units of college course work are you taking this semester? 

22. Are you working at a job(s) in addition to taking college courses? 

Yes, No. If yes, how many hours a week do you work? . is your work 

related to your major? Yes, No. 

23. Everything else equal, would you prefer a large or small class? Large, 
Small, Indifferent. Why: 

24. What is your main consideration in choosing which EC0N333 section to take? 
Da y < )- Time ( ). Class Size( Instructor ( ) 

25. How often do you read the Wall Street Journal, or the financial pages of 
major newspapers? Daily, Frequently, Occasionally, Seldom/Never. 

How often do you read Forbes/Fortune/Businessweek (etc.), or other trade 
journals? Frequently, Occasionally, Seldom/Never. 

How often do you read Times/Newsweek, etc.? Frequently, Occasionally, 
Seldom/Never. 

26. Typically where do you sit in the classroom? Front Middle Back 

27. Family background: 

Father's education , occupation 



Mother's education , occupation 

Number of children in your family you are * (1 for oldest, 2 

for 2nd oldest, etc.) . 
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